Potential-based multiobjective reinforcement learning approaches to low-impact agents for AI safety

نویسندگان

چکیده

The concept of impact-minimisation has previously been proposed as an approach to addressing the safety concerns that can arise from utility-maximising agents. An impact-minimising agent takes into account potential impact its actions on state environment when selecting actions, so avoid unacceptable side-effects. This paper proposes and empirically evaluates implementation within framework multiobjective reinforcement learning. key contributions are a novel potential-based specifying measure impact, examination variety non-linear action-selection operators achieve acceptable trade-off between achieving agent’s primary task minimising environmental impact. These experiments also highlight unreported issue with noisy estimates for agents using action-selection, which broader implications application

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

Pareto-Based Multiobjective AI Planning

Real-world problems generally involve several antagonistic objectives, like quality and cost for design problems, or makespan and cost for planning problems. The only approaches to multiobjective AI Planning rely on metrics, that can incorporate several objectives in some linear combinations, and metric sensitive planners, that are able to give different plans for different metrics, and hence t...

متن کامل

The Potential of Reinforcement Learning for Live Musical Agents

Reinforcement learning has great potential applicability in computer music, particularly for interactive scenarios and the production of effective control policies for systems. This paper considers in particular the case of interactive music systems, where a software agent can be trained online during a rehearsal session with a musician. A small-scale system is described which uses the Sarsa(λ)...

متن کامل

Learning to Teach Reinforcement Learning Agents

In this article we study the transfer learning model of action advice under a budget. We focus on reinforcement learning teachers providing action advice to heterogeneous students playing the game of Pac-Man under a limited advice budget. First, we examine several critical factors affecting advice quality in this setting, such as the average performance of the teacher, its variance and the impo...

متن کامل

hierarchical functional concepts for knowledge transfer among reinforcement learning agents

this article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for reinforcement learning agents. these definitions are used as a tool of knowledge transfer among agents. the agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. in other words, the agents are assumed t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Engineering Applications of Artificial Intelligence

سال: 2021

ISSN: ['1873-6769', '0952-1976']

DOI: https://doi.org/10.1016/j.engappai.2021.104186